168

13

Useful Algorithms

13.5

The Maximum Entropy Method

Consider the problem of deducing the positions of stars and galaxies from a noisy map

of electromagnetic radiation intensity. One should have an estimate for the average

noise level: The simple treatment of such a map is to reject every feature greater than

the mean noise level and accept every one that is greater. Such a map is likely to be

a considerably distorted version of reality. 12

The maximum entropy method can be considered as a heuristic drill for applying

D. Bernoulli (1777) maxim: “Of all the innumerable ways of dealing with errors of

observation, one should choose the one which has the highest degree of probability

for the complex of observations as a whole”. In effect, it is a generalization of the

method of maximum likelihood.

First, the experimental map must be digitized both spatially and with respect to

intensity; that is, it is encoded as a finite set of pixels, each of which may assume

one of a finite number of density levels. Let that density be m Subscript jm j at the j jth pixel.

Then random maps are generated and compared with the data. All those inconsis-

tent with the data (with due regard to the observational errors) are rejected. The

commonest map remaining is then the most likely representation. 13 This process

is the constrained maximization of the configurational entropy minus sigma summation m Subscript j Baseline log m Subscript jΣ m j log m j (the

unconstrained maximization would simply lead to a uniform distribution of density

over the pixels). Maximum entropy image restoration yields maximum information

in Shannon’s sense. 14

References

Bernoulli D (1777) Diiudicatio maxime probabilis plurium observationem discrepantium atque

verisimillima inductio inde formanda. Acta Acad Sci Imp Petrop 1:3–23

Buck B, Macaulay VA (eds) (1991) Maximum entropy in action. Clarendon Press, Oxford

Good IJ (1962) Botryological speculations. In: Good IJ (ed) The scientist speculates. Heinemann,

London, pp 120–132

Gorban AN, Popova TG, Zinovyev A (2005) Codon usage trajectories and 7-cluster structure of

143 complete bacterial genomic sequences. Phys A 353:365–387

Graps A (1995) An introduction to wavelets. IEEE Comput Sci Eng 2:50–61

Gull SF, Daniell GJ (1978) Image reconstruction from incomplete and noisy data. Nat (Lond)

272:686–690

Kendall DG (1970) A mathematical approach to seriation. Philos Trans R Soc A 269:125–135

Kruskal JB (1964) Multidimensional scaling by optimizing goodness of fit to a nonmetric hypoth-

esis. Psychom 29:1–27

Skilling J, Bryan RK (1984) Maximum entropy image reconstruction: general algorithm. Mon Not

R Astron Soc 211:111–124

12 Implicitly, Platonic reality is meant here.

13 Gull and Daniell (1978).

14 See also Skilling and Bryan (1984); Buck and Macaulay (1991).